Learning Procedural Planning Knowledge in Complex Environments

نویسنده

  • Douglas J. Pearson
چکیده

LEARNING PROCEDURAL PLANNING KNOWLEDGE IN COMPLEX ENVIRONMENTS by Douglas John Pearson Chair: John E. Laird In complex, dynamic environments, an agent's knowledge of the environment (its domain knowledge) will rarely be complete and correct. Existing approaches to learning and correcting domain knowledge have focused on either learning procedural knowledge to directly guide execution (e.g. reinforcement learners) or learning declarative planning knowledge (e.g. theory revision systems). Systems that only learn execution knowledge are generally only applicable to small domains. In these domains it is possible to learn an execution policy that covers the entire state space, making planning unnecessary. Conversely, existing approaches to learning declarative planning knowledge are applicable to large domains, but they are limited to simple agents, where actions produce immediate, deterministic e ects in fully sensed, noise-free environments, and where there are no exogenous events. This research investigates the use of procedural knowledge to support the learning of planning knowledge in large and complex environments. We describe a series of environmental properties that constrain learning and are violated by existing approaches to learning planning knowledge. We then present an operator-based representation for planning knowledge that is su ciently expressive to model complex, conditional actions that produce sequential e ects over time. We then present IMPROV, a system for learning and correcting errors in this planning knowledge that only requires procedural access to the knowledge. This procedural restriction ensures that learning remains tractable, even over these large, expressive representations. We rst explain how IMPROV incrementally learns operator precondition knowledge. We then demonstrate how a hierarchical, operator-based representation can be used to reduce the problem of learning operator e ects to the problem of learning operator preconditions. This result allows IMPROV to use a single learning method to learn both operator preconditions and e ects. This also allows IMPROV to learn complex models of actions that produce conditional or sequential e ects. Finally, we test the system in two sample domains and empirically demonstrate that it satis es many of the constraints faced by learning agents in complex and challenging environments. c Douglas John Pearson 1996 All Rights Reserved To Helen For each and every, perfectly ordinary and perfectly wonderful day.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incremental Learning of Procedural Planning Knowledge in Challenging Environments

Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning methods to directly modify an agent’s execution knowledge. These systems are robust in dynamic and complex environments but generally do not support planning or the pursuit of multiple goals. In contrast, symbolic theo...

متن کامل

Scaffolding: A way for supporting learners in e-learning environments

Introduction: One of the effective ways to help learners improve their learning in learning environments is scaffolding. Scaffolding can be defined as teachers, other learners and resources support of learner on tasks that can’t be done alone. Experts strongly demand scaffolding to be used to help learners on abstractive and complex issues. The aim of this study is to examine the scaffoldi...

متن کامل

The conflict between conceptual and procedural knowledge: Should we need to understand in order to be able to do, or vice versa?

The question in the title is the major foci when planning any learning environments. Basically it means that we have to understand how procedural knowledge and conceptual knowledge relate to each other. It seems appropriate to underline that these two types of knowledge must be somehow related when the learning process is our focus. However, it is the variables in the assessment of this process...

متن کامل

A Comparison of Expert and Novice Iranian EFL Teachers’ Procedural Knowledge in Iranian Language Institutes and Universities

This study sought to compare Iranian EFL novice and expert teachers regarding their procedural knowledge in Iranian language institutes and universities. A questionnaire was developed based on the literature, the theoretical framework, and the results of a qualitative study. This questionnaire was administered to the whole sample of the study who was 200 Iranian EFL teachers from different gend...

متن کامل

earning roeedura

Autonomous agents functioning in complex and rapidly changing environments can improve their task performance if they update and correct their world model over the life of the agent. Existing research on this problem can be divided into two classes. First, reinforcement learners that use weak inductive methods to directly modify an agent’s procedural execution knowledge. These systems are robus...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996